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Recently Tracy and Widom conjectured ]29| ] and Johansson proved [ |l7| ] that the expected shape A of the 
semi-standard tableau produced by a random word in k letters is asymptotically the spectrum of a random 
traceless k x k GUE matrix. In this article we give two arguments for this fact. In the first argument, 
we realize the random matrix itself as a quantum random variable on the space of random words, if this 
space is viewed as a quantum state space. In the second argument, we show that the distribution of A 
is asymptotically given by the usual local limit theorem, but the resulting Gaussian is disguised by an 
extra polynomial weight and by reflecting walls. Both arguments more generally apply to an arbitrary 
finite-dimensional representation V of an arbitrary simple Lie algebra g. In the original question, V is the 
defining representation of g = su(fc). 



What is the longest weakly increasing subsequence of a 
long, random string of letters? In the previous sentence, one 
such longest subsequence is "AEEEEEEEFLNNOSTTT". In 
randomly chosen English text, the longest subsequences are 
dominated by the letter 'E', since this letter is the most com- 
mon one. This implies that the length of the longest subse- 
quence has a Gaussian distribution. But if the letters in the 
string are independent with the uniform distribution, a longest 
subsequence will use all of them roughly equally. In this case 
Tracy and Widom established a non-Gaussian distribution for 
the length of a longest subsequence JI^ p9| ]. Their result was 
motivated by recent progress in the study of the longest in- 
creasing subsequence of a random permutation, in particular 
the relations among longest subsequences, random matrices, 
and representation theory 

Tracy and Widom conjectured a generalization which was 
proved by Johansson [|T^ Th. 1.6]: 

Theorem 1 (Johansson). The distribution of the shape of 
a random word as given by the Robinson-Schensted-Knuth 
(RSK) algorithm converges locally to the distribution of the 
spectrum of a random traceless k x k GUE matrix. 

It is a generalization because the first row of the RSK shape 
is the length of the longest weakly increasing subsequence. 
"Traceless GUE" refers to the traceless Gaussian unitary en- 
semble, defined up to normalization as the Gaussian measure 
on traceless kxk Hermitian matrices which is invariant under 
conjugation by unitary matrices. 

In this article we give two arguments for Theorem The 
first argument (Section |l]) is based on quantum statistics: it 
identifies the random matrix itself as a quantum random vari- 
able on the space of random words viewed as a quantum state 
space. The GUE ensemble then appears in the limit by a quan- 
tum central limit theorem. The second argument (Section 0) 



is based on classical statistics: it identifies the density formula 

CYl{\a-\bfe-^^^"^d\ (1) 



a<b 



for the distribution of the spectrum A of a GUE matrix [ |23| ] 
as a disguised classical central limit. (Here C is a constant 
that depends on k but not A.) The classical argument is rigor- 
ous and it establishes a precise estimate. The quantum argu- 
ment can be read rigorously or non-rigorously, depending on 
whether the reader accepts Conjecture eI either way it is less 
precise than the statement of TheoremlU Non-rigorously, it 
establishes convergence in distribution. Rigorously it estab- 
lishes convergence of certain moments, but not enough mo- 
ments to imply convergence in distribution. Nonetheless we 
prefer the quantum argument since it is less traditional. (But 
see Biane [§, ^ 0] for closely related results.) 

In both arguments, it is important to identify the vector 
space of traceless Hermitian matrices with the Lie algebra 
5u(fc) and an alphabet with k letters with the standard basis 
of the defining representation V — . Both arguments then 
generalize to an arbitrary finite-dimensional unitary represen- 
tation of of a compact simple Lie algebra g. The conclusion 
is a relation between random words in a weight basis of V and 
a natural Gaussian measure on g*, the vector space dual of g. 
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In this section we will express certain classical random vari- 
ables in terms of simpler quantum random variables. The 
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main object in our argument was also considered from the 
converse view by Biane ^] . 

We refer the reader to Sakurai [|7[ §3] for basic notions of 
quantum statistics, in particular mixed states, which are also 
commonly called density matrices or density operators. In 
the context of operator algebras, mixed states are called states 
I p^ ] or normal states [p^, depending on the desired strength 
of the formalism. 

The RSK algorithm is (in one version) a function that takes 
as input a word of length N in the alphabet [k] = {I, ...,k} 
and produces as output a pair of tableaux {P, Q) of shape A, 
where 



A = (Ai,A2 



, Afc) h N 



is a partition of into non-increasing, non-negative integers 
1 ^ , §7.11]. The partition is considered synonymous with 
its Young diagram, meaning its horizontal histogram. The 
tableau P is semi-standard and is called the insertion tableau, 
while the tableau Q is standard and is called the recording 
tableau. Given the uniform distribution on the set of words 
[k]^ , we can view the shape A as a random variable Arsk- 
Finally, given a partition A, it will sometimes be convenient to 
subtract the mean from each part to form a "partition of 0": 



A = (Ai 



N 
~k' 



\2- 



N 



N 



We do not need the precise definition of the RSK algorithm 
in this section, merely one of its important properties: It is a 
combinatorial model for the direct sum decomposition of the 
representation V'^^ of the Lie algebra u{k) (or the Lie group 
U(fc) or GL(fc, C)), where V = C'' is the defining represen- 
tation [eM §A2]. This representation decomposes as 



(2) 



XhN 



where is the irreducible representation of u(fc) of shape 
A and R\ is the irreducible representation of the symmetric 
group 5jv of shape A. For any given A = Arsk, the set of 
associated insertion tableaux P indexes a basis of V\, while 
the set of recording tableaux Q indexes a basis of R\. In 
particular. 



dim Rx «) Vx 



nx, 



where nx is the number of words that have shape A — Arsk- 
Finally, as a representation of the Lie subalgebra su(fc), Vx is 
unchanged if we add a constant to each component of A. The 
convention is to call it V^, the representation of highest weight 

is A. 

We can view the vector space V'^^ as a quantum state 
space H of some quantum system Q with [k]^ as an orthonor- 
mal basis. The maximum-entropy state (or tracial state) p of 
Q is then realized by the uniform distribution on [k]^ , as well 
as by the uniform distribution on any other orthonormal basis. 
At the same time, an arbitrary orthogonal direct sum decom- 
position 



of Ti. can be interpreted as a random variable taking values in 
the set of summands. Relative to the state p, the probability 
of a given summand Hi is the ratio (dimTCi) / {diuiTC). In 
particular, the direct sum decomposition in equation (||) ex- 
presses a random variable Aqm ~ A. The previous paragraph 
tells us that Aqm = Arsk, meaning that they have the same 
distribution. 



1.1 The case k = 2 and spin 1/2 particles 

As a concrete example, we consider the physically realiz- 
able case fc = 2. In this case V is the familiar state space of 
a spin 1/2 particle, and the action of SU(2) is the projective 
action of the spatial rotation group SO (3). We will use the 
alphabet {f, J,} rather than {1, 2} as a basis of V. The space 

V admits angular momentum operators J^, Jy, and which 
satisfy the commutation relations 

[Jx7 ^y] — ^Jz {^y^ '-^ z\ — ^'-^x \Jz^ Jx\ — iJy. (3) 

The operators J^, Jy, and Jz are a basis of i ■ su(2), by which 
we mean the image of su(2) in 5 [(2, C) under multiplication 
by i. Thus these are just the usual commutation relations in 
the Lie algebra su(2) up to a factor of i. The tracial state on 

V is the mixture 



P 



IT)C 



U)(il 



(4) 



Note that probabilistic mixtures of states should not be con- 
fused with quantum superpositions. A superposition of | f ) 
and I i ) is another vector in V and cannot be invariant under 
rotations. By contrast the mixed state p is SU(2)-invariant. 

The vector space V®^ is then the state space of N such par- 
ticles. The tth particle has angular momentum operators Jx\ 
Jy\ and ji*'. By equation (^ and the isotropy of p, each of 
these operators is a centered Bernoulli random variable with 
equally likely values i and — i. Since the three operators for 
any fixed t do not commute, the corresponding random vari- 
ables cannot be simultaneously observed. The sums of these 
operators form the total angular momentum. 



(5) 



for each a e {a;, y, z}. Each operator Jq, is a centered bino- 
mial random variable because the terms are independent com- 
muting Bernoulli variables. The three operators Jx, Jy, and 
Jz do not commute either, but rather satisfy the same commu- 
tation relations in equation (|^), since they express the natural 
(diagonal) action of su(2) on V®^ . Finally, the total angular 
momentum 



J' 



Jx ^ Jy ' 



Ji 



is diagonalized by the direct sum decomposition in equa- 
tion (O). In a summand with weight A, its eigenvalue is 



J' 



Ai(Ai + l)|V). 
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Thus if L = (Li, L2) is the shape-valued operator that mea- 
sures Aqm, it is related to by 

J2 = Li(Li + l). 

If we define the scaled angular momentum operators 

J a - 



N 

with a E {x, y, z}, then by the above reasoning, these three 
operators become commuting Gaussian random variables in 
the limit ^ cx). By rotational symmetry they are also in- 
dependent and identically distributed (i.i.d.). This behavior 
of the total angular momentum of independent random spins 
can be witnessed in nuclear magnetic resonance experiments, 
among other places. Using these operators we may form a 
matrix 



M = 



-J. 



In the limit N 00, M becomes a traceless GUE matrix! 
(The normalization is also consistent with Mehta [p3|].) For 
finite N, the determinant of AI must be interpreted carefully 
because its entries do not commute. If we define it by averag- 
ing over orderings of the entries. 
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detM = -(M11M22 + M22M11 - M12M21 - M21M12) 



then it turns out that 



detM = -J\ 



It follows that 



lim ^5= = Ague, 



where Ague is a random variable representing the spectrum 
A of a traceless GUE matrix. This is precisely Theorem |l] for 

k = 2. 



1.2 The general case 



The argument in Section 1 . 1 generalizes with only natural 
changes to all values of k. The defining representation V of 
su{k) has a basis of states 

|l),|2),...,|fc). 

The elements of i su(fc) may be viewed as generalized angular 
momentum operators. We define two matrices of operators 
A and B whose entries linearly span i ■ su(fc). (Note that 
the diagonal entries are not linearly independent.) Let Eab G 
sl{k, C) be the elementary matrix whose non-zero entry is 



{Eab)ab 



1. 



Then the entries of A and B are 

1 i 

Aab ^ ^{Eab + Eba) Bab ^ ^{Eba - Eab) {a b) 

Aaa = Eaa — -^I Baa — 0. 

Let AI be the matrix of operators 

Mab = Aab + iBab- 

Each entry Aab and Bab is a real-valued measurement op- 
erator. Relative to the tracial state on V, Aab for a ^ h takes 
each of the values 1/2 and —1/2 with probability 1/fc and 
otherwise has the value 0. The same is true of Bab- The mea- 
surement Aaa takes the value (fc — l)/fc with probability 1/fc 
and otherwise takes the value —1/k. The operator Mab may 
appear to be a complex-valued measurement whose real and 
imaginary parts have these distributions, but this is not quite 
true. An operator can only be interpreted as a complex-valued 
measurement if it is a normal operator, defined as an opera- 
tor that commutes with its adjoint, or equivalently an operator 
whose self-adjoint and anti-self-adjoint parts commute. When 
a ^ b, Mab is not a normal operator; it represents a complex 
random variable whose real and imaginary parts are not si- 
multaneously observable. 

For words in [fc]^, we consider the standard (additive) ac- 
tion of su(fc) on V®^ . Equation (^ gives us a shape-valued 
operator L which has the eigenvalue Aqm = A on the sum- 
mand V\ ® R\. The operator L can be realized algebraically 
using the characteristic polynomial of M, thought of as a 
polynomial-valued operator: 



C{x) = det {xl - M) 

= {x- Li){x - L2) . 



{x ~ Lk) + c{x,L), (6) 



where c{x^ L) is a polynomial of total degree at most fc — 1. 
As before, each term of the determinant is defined by aver- 
aging over the k\ orderings of its factors. The left side of 
equation (^ is a disguised version of the composition -0 o tt in 
the proof of Harish-Chandra's Theorem given by Humphreys 
[ [1^ , §23.3], while the leading term on the right side is a dis- 
guised version of the map 6. Here we are applying both maps 
to the coefficients of the ordinary characteristic polynomial of 
an element in su(fc); in the context of Humphreys, each co- 
efficient is a particular SU(fc)-invariant polynomial on su(fc). 
As Humphreys explains, the maps ipon and agree in the top 
degree, which is exactly what equation (||) asserts. (See Ok- 
ounkov and Olshanski |26| for an analysis of the correction 
term c{x, L).) 

Each coefficient of C{x) lies in the center of C/(su(fc)) and 
is a natural generalization of the Casimir operator ,P in the 
case k — 2. The coefficients are sometimes called elementary 
generalized Casimir operators. 

Assuming the tracial state on 1/®^, each measurement Aab 
and Bab is a sum of bounded, centered i.i.d. random variables. 
If we define 



M = A + iB ■ 



M, 
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then the entries commute in the limit N ^ oo and M be- 
comes a traceless GUE matrix with standard normalization. 
The term c{x, L) in equation also disappears in this limit 
because its degree is too low. The equation thus tells us that 



and let 



N 



A 



t=i 



1.3 What did we prove? 



One important step in the argument of this section is not 
completely rigorous. Unquestionably each scaled angular mo- 
mentum operator Jq, Aab, or Bab converges to a Gaussian 
random variable by the classical central limit theorem. But 
we do not know that a polynomial in these variables, for ex- 
ample or JxJyJz, converges in distribution to the corre- 
sponding polynomial of Gaussian variables. We cannot ap- 
peal to the classical multivariate central limit theorem for non- 
commuting variables, even if they do commute in the limit. 
There are also several quantum limit theorems in the litera- 
ture; one of the most general ones is due to Goderis, Verbeure, 
and Vets [Q. But these results are apparently not sufficiently 
strong either 

As a stop-gap we will conjecture the quantum central limit 
theorem that we need, and we will prove a weak version of the 
conjecture. The conjecture is naturally stated in terms of C*- 
algebras and von Neumann algebras [|l8, 19 1, which provide 
a rigorous language for quantum statistics. In this language, 
a non-commutative probability space is defined as a von Neu- 
mann algebra Ai with a normal state p. A state is defined as 
a dual vector on M, continuous in the norm topology, with 
the interpretation that for a self-adjoint element A, p{A) is the 
expected value of the random variable given by A; the state is 
normal if it is continuous in the weak topology as well. The 
reader who is uninterested in operator algebras can take 

X = Mfc = 0[(fc,C), 

the vector space of kx k matrices. A normal state p on Aik is 
any dual vector whose matrix is Hermitian and positive semi- 
definite and has trace 1 ; this is exactly the definition in physics 
of a finite density matrix §3]. In particular, the tracial state 
is defined by 

p(A) = iTrA 

Given two quantum systems with von Neumann algebras 
M and JV, the joint system has the algebra M <E) M, using 
the tensor product in the category of von Neumann algebras. 
Two normal states p and uj onM. and M form an independent 
joint state p ® lo. Given self-adjoint operators A G and 
B e M, the operators A® I and I (g) B represent independent 
measurements in the joint system. If A E A4 is self-adjoint, 
let 



Thus A expresses the scaled sum of N i.i.d. random variables. 

In formulating a multivariate quantum central limit theo- 
rem, three issues arise because of non-commutativity. First, 
the theorem must be a statement about the distribution of 
non-commutative polynomials p g C{Ai,... ,Ak), but a 
Gaussian central limit would describe the distribution of com- 
muting variables. Second, a general polynomial expression 
p{Ai, ... ,Ak) need not be a self-adjoint operator, even if the 
variables are self-adjoint. Thus we will assume that p is a 
self-adjoint polynomial, meaning that it is invariant under the 
anti-linear anti-involution 



C{A, 



that conjugates each coefficient and reverses the order of each 
term. Third, if we define the covariance matrix of the variables 
Ai,... ,Ak as 

Kab = p{AaAb), 

it may not be symmetric. When this happens the behavior of 
the central limit is genuinely different from the classical case; 
it has been studied in Reference |llj But in the case that we 
need (namely, when p is tracial), the covariance matrix is sym- 
metric. In this case we expect that the limiting distribution of 
p only depends on its commutative image p G C[Ai, . . . ,Ak], 
thereby resolving the first issue. 

Conjecture 2. Let {A4 , p) be a quantum probability space, 
and let Ai, . . . , Ak be self-adjoint elements with mean and 
a symmetric covariance matrix. Let p G C(Ai , . . . , A^) be a 
self-adjoint non- commutative polynomial in k variables. Then 



N 



lim p{Ai 



,Ak)^piXi 



(8)71(8) gM 



where Xi , . . . , Xk are classical Gaussian random variables 
with covariance matrix 

E[XaXb]^ p{AaAb). 

If we let p be a coefficient of the polynomial C{x) from 
Section [l|. Conjecture ^ then implies Theorem [l|. It may be 
possible to be reverse this reasoning and use Theorem |l] to 
prove Conjecture || for arbitrary p, at least when is a ma- 
trix algebra and p is the tracial state. But a satisfactory proof 
would hold for arbitrary quantum probability spaces. 

Our Theorem H below establishes convergence of moments 
in the context of Conjecture ^ Since this theorem is al- 
most entirely algebraic, we do not need the full structure of 
a von Neumann algebra. Rather we let Ai be an arbitrary *- 
algebra, meaning a unital ring over the complex numbers with 
an anti-linear anti-involution *. A state p on a ^-algebra is a 
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^-invariant dual vector such that p{I) = 1 and p{A^) > for 
every self-adjoint A E Ai. Finally the nth moment 

is defined whether or not A is self-adjoint. (Recall that that 
a non-self-adjoint operator may be written as A + iB, where 
A and B are self-adjoint. Consequently it may be interepreted 
as a complex-valued quantum random variable whose real and 
imaginary parts are not simultaneously observable. Our defi- 
nition of moments is consistent with this interpretation.) 

Theorem 3. Suppose that M. a '^-algebra with a state p. Let 
Ai, . . . , Ak be self-adjoint elements with mean 0. Suppose 
that for all a and b, 

p{AaAb) = p{AbAa). 

If p € C(Ai, .... Ai;) is a non-commutative polynomial, then 
lim 7„(p(Ii, . . . ,Ife)) = ln{p{Xi, . . . ,Xk)), 

where Xi , . . . , Xk are classical centered Gaussian random 
variables with covariance matrix 

E[XaXb]= p{AaAb), 

and ^n{A) is the nth moment of A. 

Proof. Since the assertion is claimed for every polynomial, it 
suffices to prove that the expectation 

converges. To show convergence of expectation we may let p 
be a monomial. Indeed the monomial 

p{Ai,... ,Ak)^ AiA2...Ak 

will do, since some of the factors may be equal. 
Expanding the expression 

7 = p®^(Ii,... 

using the definition of Aa, it has a term for each function 
from [k] to [N]: 

4,:lk]-~,lN] 



Let 



be an individual term in this expansion. Since we are comput- 
ing the expectation with respect to the product state, we can 
arrange the factors with respect to [N] rather than [k]: 

i,=x-^i^ n ^( n ^'^y 

te[Af] 0(a)=t 



In this form it is clear that 

70 = 

if there is a f such that </'^^(t) has one element. At the same 
time, if S is the set of those functions whose images have 
fewer than k /2 elements, then 

lim ^ 7^ = 



AT— >oo 



because 



1^1 = o(X^I^). 



In other words, there are sufficiently few such functions that 
they are negligible in the limit. What is left is the set of func- 
tions that are exactly 2-to-l, which only exist when k is even. 
Thus if M is the set of perfect matchings of [fc], then 



meM {a,b)em 



+ o{l) 

(fc-1)!! ^ H p{AaAb)+o{l), 

meM (a,b)em 



where 



(A:-l)!! = (A:-l)(/s-3)---5-3-l 



is the odd factorial function. In the limit 7 exactly matches 
the corresponding expectation 

E[Xi . . . Xk] ^ (k - iy.\ Y[ E[XaXi,] 

meM (a,b)£m 

of the classical variables Xi, . . . , Xk- □ 

In relation to Theorem [l|. Theorem || says that if 

Cx{x) = (a; - Ai)(a: - A2) . . . (x - Xk) 

for a partition A, then the moments of (x) converge to 

the moments of C\^^j-i^{x). In other words, the "moments 
of the moments" of Arsk converge after scaling to those of 
Ague- Unfortunately, when k > 2 the tail of the distribition 
of Cague i^) is too thick for convergence of moments to imply 
convergence in distribution. 

2. LOCAL LIMITS 

The argument of this section and its generalization below 
(Theorem^) are very similar to a result of Biane [^. It was 
also found by Grinstead | ]l4| ] in the case k = 2, and it is related 
to some results of Grabiner [jl^ . 

The idea of the argument is that, if we name the dimensions 
appearing in equation (S), 



fx = dim Rx 



dimVx nx = fxdx, 
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the quantity f\ can be considered in the context of the k-ballot 
problem. Suppose N voters vote sequentially for an ordered 
list of k candidates. In how many ways can they cast their 
votes so that the ath candidate is never ahead of the 6th candi- 
date for a > b, and at the end the ath candidate has votes 
for every a? Such a sequence of votes is a ballot sequence 
of shape A and there are fx of them Prop. 7.10.3]. In 
this context of the RSK algorithm, f\ is the number of stan- 
dard tableaux of shape A. If the tth entry of such a tableau 
is in row a, we can say that the tth voter votes for candidate 
a. This establishes a bijection between standard tableaux and 
ballot sequences. That f\ is the number of standard tableaux 
of shape A can also be seen directly from the representation 
theory of 0[(fc): The generalized Clebsch-Gordan rule states 
that 



A'=A+n 

where the sum is over shapes A' that are obtained from A by 
adding a single box. Thus the multiplicity of V\ in V'^^ is 
the number of increasing chains of partitions from the empty 
partition to the partition A. Such a chain is equivalent to a 
standard tableau of A by assigning f to a box if it appears at 
step t. 

In this formulation, the number f\ may be computed by 
the reflection principle []lo[|, which is also a disguised version 
of the Weyl character formula 1 1 3 1 . Recognizing the set of 
partitions as a subset of Z'^, there is an action of the symmetric 
group Sk on Z*^ given by permuting coordinates. A chain of 
partitions is then a lattice path in Z'^ that happens to stay in 
the cone of partitions. Here a valid lattice path is one which 
increases one coordinate by one at each step. Consider the 
partition 

6 = {k-l,k-2,k-3... ,2,1,0) 

and let m\ for any A € Z*^ be the number of lattice paths from 
the origin to A. The reflection principle shows that 



/a = E (-1) 



(7) 



Equation says that the number of ballot sequences from 
to A is the alternating sum of unrestricted lattice paths from a 
set of image points of the form ^{S) — S to X. Figure |l] shows 
an example of the principle when fc = 3; in the figure, parti- 
tions A are replaced by A to obtain walks in a 2-dimensional 
lattice. 

Since the numbers mx are defined by walks with the same 
k possible steps at each time t, they can be approximated by 
the local central limit theorem: 

mx ^ Ce-'^^'/^JV 

Here and below we assume that C is a constant depending 
only on N and fc, and we use the notation 



a=l 




Figure 1 : Paths from A to and an image point in the weight 
lattice of su(3).. 



If the approximation (g) were robust with respect to local finite 
differences, then by the reflection principle it would give us an 
estimate for fx- If it were also robust under amplification by 
a polynomial in A, it would give us an estimate for 

nx = f\dx, 

since the Weyl dimension formula JT^ §24.3] says that 

Aq - Ab + a - 5 



dx = X{ 

a>b 



a — b 



(9) 



is a polynomial in A. Both of these refinements of the lo- 
cal central limit theorem are true for arbitrary bounded lattice 
walks. To state the theorem, we introduce a few definitions. 

A finite difference operator _D is a linear operator on func- 
tions 

p-.R'' 

defined by a finite sum 

Dp{v) = ^ctf{v + vt) 
t 

for some constants {ct} and some vectors {vt}. The degree 
a of D is the minimum degree of a polynomial p such that 
Dp 7^ 0. If L C M'^ is a lattice, the determinant 

detL = Vol M'^/L 

is defined as the volume of the quotient space, or equivalently 
as the determinant of a positive basis for L. 

Theorem 4. Let X be a bounded, mean random variable 
taking values in z + L for some lattice L C M*^ and some 
vector z G M''. Assume that L is the thinnest such lattice for 
the given X. Let 

N 
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denote the sum of N independent copies of X. Let 

p{v) = P[X' = v] 



q{v) 



detL 



^-K-\v,v)/2N 



where k is the covariance form of X. Then for every finite 
difference operator D of degree a and for every integer b > 0, 

lim iV('''-^+'^)/2|t;|^D(p - q){v) = 

uniformly for v G Nz + L. 

Lawler ||l|,Th. 1.2.1] proves a special case of Theorem W 
in which a and b are or 2, X has the uniform distribution 
among nearest-neighbor steps in Z*^, and D has a restricted 
form. However, the proof actually establishes Theorem ^ in 
its almost its full generality, requiring only that b be even. The 
conclusion for b odd follows by taking the geometric mean of 
the formulas for 6—1 and 6+1. 

Since f\ is given by the hook-length formula [ p8| , Cor. 
7.21.6], we can also prove Theorem Q in this special case using 
Stirling's approximation [|l7| §4]. Such a special argument is 
analogous to the special argument for the Laplace-de Moivre 
theorem, which is the simplest case of the usual central limit 
theorem. But it is not enough for our later generalization. The- 
orem |[ 



Corollary 5. IfXhN, then 



lim N''^^{k-^nx^cY[{Xa^Xb)^e- 



k\^/2N\ 



a<b 



uniformly in A. 

Proof. We apply Theorem ^. First, we change A from a sub- 
script to an argument in certain quantities that depend on it 
(and implicitly on N): 

/(A) = /a m[\) = mx n{X) = nx, 

where dependence on TV is implicit in the notation. Let L = 
A be the set of all centered partitions A (the weight lattice). 
Define a finite difference operator D by 

DpW= 5](-irp(A + ?-a(?)) 



so that 



Dm{X) = /(A) 



by equation (|^). The two important properties of the op- 
erator D are first, that it is antisymmetric under the Weyl 
group Sk after translation by 6, and second, that it has degree 
k{k — 1) /2. The degree of D follows from a factorization that 
appears in proofs of the Weyl dimension formula IIS, §24.3]: 



D 



n 



where <I>_|- is the set of positive roots of su(fc) and 
Z?„(A) ^p(X)-p(X + a). 

Each Da has degree 1 and there are fc(fc — l)/2of them. Note 
that the only antisymmetric polynomial of degree fc(fc — l)/2 
is 

a<b a<b 

When N is large, 

because in the limit D becomes an antisymmetric differential 
operator of degree k{k — l)/2. When applied to a symmet- 
ric Gaussian, it produces an anti-symmetric polynomial factor 
of degree k{k — l)/2. The polynomial A is the only choice 
for this factor up to scale. Thus the operator D explains one 
factor of A in the statement of the corollary. The other factor 
is given by d\, which is also proportional to A in the limit as 
A h iV goes to oo. Theorem Jjthen establishes the stated ap- 
proximation for nx, where D and each contribute a factor 
of A. □ 

Corollary ^ is evidently the precise statement of Theorem |l]. 

3. GENERALIZATIONS 

The first way that we can generalize Theorem |l] that is 
that we can replace the representation V of su(fc) by some 
other finite-dimensional representation W . In general a ten- 
sor power of such a representation decomposes as 



W®"" ^^Tx,N®Vx, 



(10) 



AhO 



where each Ta.tv is a vector space on which su(fc) acts triv- 
ially. (In this generality it does not make sense to make A a 
partition of N or any other particular integer, so we take it to 
be a highest weight, or a partition of 0.) The space Ta.w is a 
representation of the symmetric group Sn, but it is not usu- 
ally irreducible, not even when W is. Assuming the a state on 
W'^^ which is invariant under the action of su(fc), we may as 
before use equation (|l^ to define a quantum random variable 
Aqm- 

It is less trivial to define a classical counterpart Arsk, or 
even the space of words on which it is defined. If W is ir- 
reducible, we can model it as a summand of V®^ for some 
I. More precisely, we choose a partition fj, \- t such that 
W ^ Vji, and we choose a specific standard tableau Qw 
of shape /i. Then the set S C [kY of words with recording 
tableau Qw indexes a basis of W. The set S can be inter- 
preted as a "syllabic alphabet", in the sense that a word w 
of length N over the alphabet S is simultaneously a word of 
length £N over the alphabet [k] . Remarkably, the RSK algo- 
rithm is compatible with this dual interpretation; If we define 



a6<E>4 
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the shape X of w G by spelling it out in [k] and taking 
the usual shape, then once again 

dimTA^AT ®V\=nx, 

where n\ is the number of words w with shape A. (One way 
to argue this fact is with the theory of Littelmann paths; see 
below. Syllabic expansion corresponds to concatenation of 
paths.) 

For example, if fc = 2, then has a summand W = 
V2 isomorphic to the adjoint representation of su(2). As it 
happens, this summand occurs only once. If we take left and 
right parentheses { ) . ( } as the alphabet for the basis of V 
rather than {1,2}, then the first component Ai of the centered 
shape of a string is half the number of unmatched parentheses. 
For example, Ai = 1 for the string 



The alphabet S for the representation W is the set of three 
pairs of parentheses { (( , )( , )) } other than the two that 
match each other If we rename this alphabet { ( , | , ) }, then 
one can check that the only words that match completely are 
those that form nested complete "bra-kets": 



( I I ,( I ), I ) 



A general string will have a maximal substring of this form, as 
well as fragments consisting of unmatched "bras", unmatched 
"kets", and unbracketed separators: 

The statistic Ai is then the number of these fragments. For 
example, Ai = 2 for the string 



since there are four unmatched parentheses if the bra-kets are 
expanded into parentheses: 




More generally still, we can let W be any non-trivial, finite- 
dimensional, unitary representation of any compact simple 
Lie algebra g. (We say that a Lie algebra is compact if it in- 
tegrates to a compact Lie group.) Once again there is a direct 
sum decomposition 



where A is the weight lattice of g and Vx is the irreducible 
representation of highest weight A. As before this decompo- 
sition defines a quantum random variable Aqm if we assume 
the tracial state p on W. 



If W is irreducible, the theory of Littelmann paths then pro- 
vides a satisfactory combinatorial counterpart Alp with the 
same distribution as Aqm @]- If W V^, then we can 
apply the Littelmann lowering operators to some fixed domi- 
nant path from the origin to fx. There is a natural bijection 
between the resulting set of paths P{W) and a basis of V^. 
Moreover, a word w in forms a longer path j{w) given by 
concatenating letters. If we apply Littelmann raising opera- 
tors as many times as possible to 7(w), the result is a high- 
est weight A. Assuming the uniform distribution on P^, this 
weight defines a random variable random variable Alp = A. 
Note that Alp depends on p^, although its distribution does 
not. 

Although the abstract setting of Littelmann paths looks 
quite different from the Robinson-Schensted-Knuth algo- 
rithm, it is in fact a strict generalization [|T|. Briefly, if 
W — V is the defining representation of 5u{k) and is a 
straight line segment, then every element of P^ is a straight 
line segment, and these segments are naturally enumerated by 
the integers 1, . . . ,k. The highest weight Alp of a word w 
coincides with the centered shape of the tableau produced by 
a dual RSK algorithm defined using column insertion. (The 
standard RSK algorithm uses row insertion.) By one of the 
symmetries of the RSK algorithm [|8], Cor. AL2.11], this 
shape is the same as the row-insertion shape of the reverse 
word w* . Thus 



Alp(w) = Arsk(w*) Alp = A 



RSK- 



Finally, if W is not irreducible, then we can choose a sep- 
arate alphabet for each summand in a direct-sum decomposi- 
tion. For example, if 

for some dominant weight /i, then we can let the alphabet be 
the disjoint union of two copies of the same alphabet P^, a 
"red" copy and a "blue" copy. The fundamental properties of 
Littelmann paths imply that in all cases, Aqm = Alp, where 



P[Xqm = A] = 



(dimVK)^ ■ 



What is the counterpart to Ague in this context? Taking 
Section |l] as a guide, each element A & i ■ q defines a real- 
valued random variable on the quantum space {W, p). These 
variables have the covariance form 



Kw{A,B) ^ p{AB) 



dim 



All together they can be taken as 1 -dimensional projections of 
a quantum random variable x e i-Q*. Instead of the spectrum, 
we can consider the orbit of ix under the co-adjoint action of 
g oni ■ g*. By standard representation theory, ix is conjugate 
to a unique weight 

Ae £c r Cfl*, 

where C is a Weyl chamber in ()*, the dual space to a Cartan 
subalgebra of g. If we assume a Gaussian distribution pw on 
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i ■ g* with covariance matrix k\y, the corresponding distribu- 
tion for the weight A is 

ew{X)dX = C Yl a{Xfe-''^v(^^^^'>/^dX, 

where as before is the set of positive roots in g. This dis- 
tribution can be derived in the same way as equation ([l|). If 
g = 5p(2n) (the compact form of sp(2n, C)), it is the Gaus- 
sian symplectic ensemble (GSE). But if g = 5o{n), it is the 
Gaussian antisymmetric ensemble (GAE). Since symmetric 
matrices do not form a Lie algebra, the Gaussian orthogonal 
ensemble (GOE) would require some yet more general model. 
Finally we can state the general theorem. 

Theorem 6. Let W be a non-trivial, finite-dimensional, uni- 
tary representation of a compact simple Lie algebra g of rank 
r. Let n\ be the dimension of the isotypic summand ofW"^^ 
of highest weight A. Then 

uniformly in A. 

The arguments of Sections [l| and ^ both generalize in a 
straightforward way to proofs of Theorem |[ As before, Sec- 
tion n establishes a weak version of it at a rigorous level. We 
also comment that the tautological matrix M of Section |l] 
should be replaced by a certain i ■ g*-valued measurement op- 
erator 

M e g* «)g 

acting on W^^ . As a tensor in g* (g) g, M is again tautologi- 
cal; it comes from the identity linear transformation from g to 
itself. 

Remark. The Lie algebra picture of Theorem ^ suggests an- 
other interpretation which is dual to that of Section nl and in 
another sense dual to that of Section |[ If G is a compact, 
simple Lie group with Lie algebra g and Vt^ is a unitary rep- 
resentation of G, then the absolute value of the character xw 
of W has a local maximum at 1 e G. When N is large, 
the character Xw °f W^®^ is approximately a Gaussian in 
a neighborhood of L If we inflate G by a factor of VlV, it 
converges to g, and multiplication on G converges to addition 
on g. The character Xw converges to a limit on g, namely 
the Fourier transform of the Gaussian distribution fiw on g* 
defined above. This intermediate picture led the author from 
Section ^ to Section 

3.1 Things out of reach 

When g = 5u(fc), Theorem ^ can be interpreted as a limit 
distribution result for the shape Arsk of words with various 
interesting distributions. For example, if each letter of the al- 
phabet for the representation V2 of su(2) is expanded into a 
pair of letters in the alphabet {t, i}, then the distribution p 



on expanded words is determined by its correlations for di- 
graphs (adjacent pairs of letters). In this case the correlation 
between the tth and t + 1st letter depends on whether t is odd 
or even. But random words associated with representations 
such as V2 V3 do not exhibit such irregularities. 

Especially when k = 2, these distributions resemble distri- 
butions given by doubly stochastic Markov chains. In other 
words, the first letter wi of a random word w G [k]^ has the 
uniform distribution. Each subsequent letter depends on the 
immediate predecessor (but not on earlier letters) according 
to a Markov matrix M: 

P[wt+i = a I Wf = 6] = Mat- 

Here M is chosen so that every letter has the uniform distri- 
bution if the first one does. 

What is the asymptotic distribution of the shape Arsk of 
a random word w G [k]^ generated by a doubly stochastic 
Markov matrix M? Non-rigorously we expect it to have the 
form 

GP(A)e-'=^'/2.Ar^ 

Here P is some polynomial (or at least some function which 
is asymptotically polynomial) and v is the variance per letter 
of w. The variance ti is defined by the formula 




kP[wo = wt]- I 
k-1 



t— — oo 



using a bi-infinite word w generated by M. 

We have conducted computer experiments with different 
choices of M with 2- and 3-letter alphabets [^0[]. Figure ^ 
shows the distribution of Ai for 400,000 words generated by 
each of the following four Markov matrices M: 

1 /I 2 1\ 1 /I 1 1\ 

41^0 1 3yl Hill/ 

l/30l\ i/310\ 
G+ = -130 G_ = - 031 

4 \0 1 3/ ^[103) 

The lengths of the words are 1620, 3420, 1140, and 1140 in 
the four respective cases. These lengths were chosen so that 
the four types of words would have the same total variance 
(ignoring boundary effects). The experiments indicate that the 
distribution of Ai (the centered length of the longest weakly 
increasing subsequence) in the asymmetric distribution A is 
genuinely different from the referent uniform distribution F. 
The lower median value of Ai in this case does not disappear 
as the words grow longer. It also cannot be explained as a 
maladjusted variance, because at the other end the tail of A 
eventually overtakes the tail of F. On the other hand, the dis- 
tribution for the cyclic Markov chains G+ and G_ do appear 
to converge to the distribution for F. Their symmetry implies 
that the longest weakly increasing subsequence sees the same 
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Ai 

Figure 2: Distribution of Ai, the centered length of the longest 
increasing subsequence, for words generated by four different 
Markov chains, 400,000 trials each. 



3.2 Infinite matrices 

The most interesting case of Theorem |l] to consider (indeed 
the case that motivated the result) is the limit fc — > cx). We 
have no firm results about this limit, but we can propose a 
model of it that may be important. Our model might be related 
to the semi-infinite wedge space model of Okounkov [^. 
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Figure 3: The hyperfinite II i factor as a matrix algebra 
over itself. 



fluctuations in the transition 1 —> 2 as it does for the transi- 
tion 2^3, which is apparently enough to produce the same 
distribution. 

Conjecture 7. Let M be an indecomposable, doubly stochas- 
tic matrix such that 

where k + 1 = 1. Then the distribution of the shape of a 
word of length N generated by M converges locally to the 
distribution of the spectrum of a traceless k x k GUE matrix. 

Conjecture ^ can be generaUzed further by considering 
other cyclically symmetric, translation-invariant measures on 
words whose correlations decay sufficiently quickly. 

Problem 8. Let V — C'' and let p be the state on F**" ex- 
tending a distribution on [fc]^ generated by a doubly stochas- 
tic Markov chain. What is the limiting distribution of Aqm •'' 

Problem ^ is really a statistical mechanics question con- 
cerning a quantum spin chain with certain nearest-neighbor 
interactions. It cannot be stated in terms of Arsk because 
there is no reason to expect that Arsk = Aqm in this general- 
ity. Yet more generally, we can ask about the behavior of Aqm 
for an arbitrary nearest-neighbor interaction that produces the 
tracial state when restricted to a single site. 

Problem 9. What is the distribution of the longest weakly in- 
creasing circular subword of a circular word w G [A:]^ 

In Problem |[ we assume that both [fc] and [N] are circu- 
larly ordered. We do not know if there is a suitable circular 
generalization of the RSK algorithm. 



Consider the Hilbert space H = L^([0, 1]). For every k, the 
matrix algebra Mk acts on H by taking 

if / is supported on [^^, f], and 

if / vanishes on [^^, f]- The weak-operator closure of all 
of these algebra actions is a von Neumann algebra Ai called 
the hyperfinite IIi factor [ |l9| , §12.2]. For every fc, is a 
k X k matrix algebra over itself (Figure Thus it could be 
generally important in random matrix theory. 

In this case we are interested in the Lie algebra structure 
of Ai (in addition to its topologies), making A4 an infinite- 
dimensional analogue of 0[(fc, C). We are also interested in 
the tracial state p, defined as the continuous extension of the 
tracial state on each A4k- It is a normal state. It is also a 
model of the uniform measure on the interval [0, 1]. Finally 
we define Ai to be the kernel of p, analogous to the space 
5l(fc, C) of traceless matrices. 

As a Lie algebra, Ai acts on 7i®^ for every N. This ac- 
tion commutes with the action of the symmetric group Sn on 
jf^N gjygjj permuting tensor factors. There is a direct-sum 
decomposition 

H«^ = 0Ha, (11) 

AhJV 

where Hx is, as a representation of Sn, tiie isotypic summand 
of type R\. Each of these representations has a measure- 
theoretic dimension defined using p®^ on 7i®^ (in which 
Ti embeds by the usual Leibniz rule for Lie algebra actions): 

f2 
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pS5N ^jgQ expresses the uniform measure on 



Thus equation ( |lT| ) is a quantum statistics model for the 
Plancherel measure on the symmetric group. For each N, it 
defines a quantum random variable Aqm N 
The state 

[0,1]^, i.e., the process of choosing a "word" of N random 
points in the unit interval. The usual RSK algorithm is de- 
fined for such words. Since the letter of the word are distinct 
almost surely, and since the RSK algorithm depends only on 
the order of the letters and not their values, it defines a random 
variable Xrsk equivalent to the shape of a random permuta- 
tion. Its distribution is also the Plancherel measure. 

By the quantum central limit theorem, the state p^^ should 
produce a Gaussian measure on A4 in the limit N oo. So 
should the GUE measure on s[(fc,C) in the limit k ^ oo. 
The relation between these two limits could shed light on 
the Vershik-Kerov limit for Plancherel measure and the 
Wigner semicircle for the spectrum of a random matrix [ |23| ] . 
The quantum central limit theorem might also predict the dis- 
tribution of the deviation from a semicircle, at least to first 
order in N. 
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